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Abstract 

Normal variance-mean mixtures encompass a large family of useful distributions such as 
the generalized hyperbolic distribution, which itself includes the Student t, Laplace, hyper- 
bolic, normal inverse Gaussian, and variance gamma distributions as special cases. We study 
shape properties of normal variance-mean mixtures, in both the univariate and multivariate 
cases, and determine conditions for unimodality and log-concavity of the density functions. 
This leads to a short proof of the unimodality of all generalized hyperbolic densities. We 
also interpret such results in practical terms and discuss discrete analogues. 

Keywords: convex contours; distribution theory; generalized inverse Gaussian distri- 
bution; log-convexity; modified Bessel function; normal mixture; Poisson mixture; total 
positivity. 
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1 Introduction 

A univariate normal variance-mean mixture (Barndorff-Nielsen, Kent and S0rensen 1982) is the 
distribution of 

Y = fi + ^X + aVxZ (1) 

where X and Z are independent scalar random variables, Z ~ N(0,1), X has a density (the 
mixing density) supported on (0,oo), and — oo < fi, 13 < oo, a > are constants. Equivalently, 
a normal variance-mean mixture is the distribution of a Brownian motion with drift stopped 
at an independent random time. Normal variance-mean mixtures encompass a large family of 
distributions commonly used in many applied fields. A prominent example is the generalized 
hyperbolic (GH) distribution (BarndorfF-Nielsen, 1977). This distribution has an unnormalized 
density of the form (y G R) 

gh(y; /i, A, a, /3, 6) oc e^(^-^) {S^ + {y - '^^'"^'^^^ K,_,/, (^a,/6^TW^) , (2) 

where Ki,{z) denotes the modified Bessel function of the second kind (Abramowitz and Stegun 
1972). The allowable parameter values are /i G R and 

S>0, |/3| < a, if A > 0; 
(5 > 0, \f3\ < a, if A = 0; 
S>0, |/3| < a, if A < 0. 
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The density ([2]) arises as the density of y in ([T]) if we let cr = 1 and let the density of X be the 
generalized inverse Gaussian (GIG; J0rgensen 1982) 



X' 



,A-1 



exp --[xx ^ +iJx) 



x > 0. 



(3) 



Parameters other than the common A are related by x = and = — fi"^ . 

The density ([2]) includes the Student t, Laplace, hyperbolic, normal inverse Gaussian, and 
variance gamma densities as special cases. These are important in modeling financial data; 
see, e.g., Eberlein and Keller (1995), Barndorff-Nielsen (1997), Eberlein (2001), and Chen, 
Hardle and Jeong (2004). The appearance of modified Bessel functions, however, makes them 
nontrivial for both theoretical analysis and numerical computation. We refer to Protassov (2004) 
for computational strategies in parameter estimation. 

This paper is motivated by a basic property of the GH density, namely its unimodality. A 
function f{x) on R is unimodal, if there exists x,, G R such that f{x) increases on (— oo, x*] and 
decreases on [x*,oo). Besides being inherently interesting, unimodality results are also useful. 
Chebyshev's bounds for tail probabilities can be sharpened considerably if the distribution is 
known to be unimodal (Sellke and Sellke 1997). Shape properties such as unimodality are also 
important in random variate generation (Devroye 1986). 

All GH densities are unimodal. Although this can be verified analytically in some cases, it 
is certainly not obvious from the formula ([2]). The only general proof of unimodality we know is 
based on two deep results: (i) GH distributions are self-decomposable (Halgreen 1979; Shanbhag 
and Sreehari 1979; Sato 2001) and (ii) self-decomposable distributions are unimodal (Yamazato 
1978; Steutel and van Harn 2003). Conceivably there may exist a direct proof based on modified 
Bessel functions inequalities (see Yu 2011 for a similar situation). In this paper, however, we 
take another approach and obtain a unimodality condition for a general normal variance-mean 
mixture. The GH case follows as a corollary. 

Our main result (Theorem[T]) says that unimodality is preserved by forming univariate normal 
variance-mean mixtures. The same holds for log-concavity. A nonnegative function /(x), x € R, 
is log-concave if 



Log-concavity implies unimodality and is often referred to as "strong unimodality" (Dharmad- 
hikari and Joag-dev, 1988). We also consider log-convexity. A positive function / on an interval 
/ is log-convex if log / is convex on /. 

Theorem 1. Suppose X and Y are related by (C]) with densities g and f respectively, 
i If g is unimodal, then so is f. 

ii If g decreases on (0, oo), or f3 = 0, then the only mode of f is at fi. 
Hi If g is log-concave, then so is f. 

iv If g is log-convex on (0,oo), then f is log-convex on each of {—oo, fj,) and (/i,oo). 

Theorem [T] has clear practical implications. According to part (i), one must incorporate a 
multimodal mixing distribution in order for a normal variance-mean mixture model to capture 



f{ex+{l-e)y)>[f{x)f[f{y)] 



for all x,y G R, 6* G (0,1). 
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multimodality in the data. This contrasts with other shape properties; skewness, for example, 
depends on the parameter (3 and is not entirely inherited from the mixing distribution. On the 
other hand, log-concave densities have no heavier than exponential tails. Part (iii) therefore 
indicates that normal variance-mean mixtures inherit their heavy tails from the mixing distri- 
butions. That is, one needs to choose a heavy-tailed mixing distribution in ([1]) in order to model 
heavy tails in the data. 

In the case of the GH distribution. Theorem [1] specializes to Corollary [TJ We obtain a short 
proof of unimodality and a simple criterion for log-concavity for GH densities. 

Corollary 1. Let f{y) denote the GH density given by (0). 

i The density f{y) is unimodal. 

a The mode of f{y) is at fi iff either /3 = or 5 = 0, 0<A<1. 

iii The density f{y) is log-concave iff X>1. 

iv The density f{y) is log-convex on each of {—oo,fi) and (fi, oo) iff 5 = and < A < 1. 

Theorem [1] is related to the following result for normal mean mixtures (Dharmadhikari and 
Joag-dev, 1988; Bertin and Theodorescu, 1995). Suppose Y\X ~ N(X, a^) and X has density 
g. Then, denoting the marginal density of Y by /, we have (i) if g is unimodal then so is /; 
(ii) if g is log-concave then so is /. While these two hold by basic convolution properties of 
log-concave functions, the proof of Theorem [1] is more complicated. A key tool is the theory of 
total positivity (Karlin 1968); see Section 3 for details. 

We also consider the multivariate case. A p-variate normal variance-mean mixture is the 
distribution of 

Y = ^i + /3X + VXAZ (4) 

where fi, j3 are p x 1 vectors, ^4 is a full-rank py.p matrix. A" is a scalar random variable supported 
on (0, oo), and Z is a p x 1 standard normal vector independent of X. Owing to properties of the 
multivariate normal, such mixtures are closed under marginalization and linear transformations. 
They have the following feature regardless of the shape of the mixing density (|| • || denotes the 
Euclidean norm). 

Proposition 1. Suppose 13 ^ in Then for each t £ H, the density ofY, denoted by f{y), 
has ellipsoidal contours on the hyperplane 

nt = ^^ + /3t + ^zeW : z'^{AA^)-^/3 = o} . 

Specifically, f{y) depends only on \\A''^{y — jJ- — f3t)\\ for y € T-Lt- 

For example, if A is the identity matrix, then f{y) is spherically symmetric on any hyperplane 
orthogonal to (3 (one may say f{y) has "spindle- like" contours, j3 being the direction of the 
center) . 

Theorem [2] extends Theorem [1] to the multivariate case. The notion of log-concavity is 
trivially extended from the univariate case. Unimodality is more delicate, however, and there 
exist several alternative definitions (Dharmadhikari and Joag-dev 1988). Theorem [2] uses a 
rather weak form of unimodality. 
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Theorem 2. Suppose X and Y are related by ^ with densities g{x) and f{y) on x £ (0, oo) 
and y £ 'RP respectively. Let g*{x) = x~^P^^^^'^g{x). 

i If g*{x) is unimodal, then f{y) has only one local maximum, which lies on the line y = 
H + (3t, t e R. 

a If g*{x) decreases on (0,oo), or /3 = 0, then the only local maximum of f{y) is at y = fj,. 
Hi If g*{x) is log-concave, then so is f{y). 

iv If g*{x) is log-convex on (0,oo), then f{p + bt) is log-convex on t £ (0,oo) for all nonzero 
b e RP. 

An important case is the multivariate generalized hyperbolic (MGH) distribution, which 
arises when X in @ has a GIG distribution. The MGH distribution is further closed under 
conditioning, that is, if y = (1^1,12)^ has an MGH distribution then so does Yi conditional on 
Y2. This allows us to strengthen the unimodality part of Theorem [2j We say a function /(y) on 
RP has convex contours, if the set {y G TV : f{y)> c} is convex for all c G R. Obviously, all 
local maxima of a density having convex contours must be global maxima. 

Corollary 2. Suppose the density of X in 1^ is given by and let f{y) denote the density 
ofY. 

i The density f[y) has convex contours. 

a The mode of f{y) is at fi iff (3 = or x = 0, < X < {p + 1) /2. 
Hi The density f[y) is log-concave iff \> {p + l)/2. 

iv The function /(^ + bt) is log-convex on t £ (0, 00) for all nonzero b G R^ iff X = ^ ^.''^d 
< A < {p+l)/2. 

Theorem [2] and Corollary [2] are proved by reducing the problem to a univariate one and then 
applying Theorem [TJ see Section 3. We discuss discrete analogues in Section 2. 

2 Discrete Analogues 

There exist analogous results for mixtures of discrete distributions (Karlin 1968; Bertin and 
Theodorescu 1995). In the univariate case, discrete unimodality and log-concavity are defined 
in obvious parallel with their continuous counterparts. We mention the following result for 
Poisson mixtures. 

Theorem 3. Let X be a positive random variable with Lebesgue density g{x). Given X, let Y 
have a Poisson(X) distribution, and denote the marginal probability mass function ofY by f{y). 
If g{x) is unimodal on x £ (0, 00) then so is f{y) on y = 0,1, .. .. The same holds if "unimodal" 
is replaced by "decreasing, " "log-concave, " or "log-convex". 

The unimodality part of Theorem [3] is a classical result of Holgate (1970); see also Bertin 
and Theodorescu (1995). The decreasing and log-convexity parts are obtained by Steutel and 
van Harn (2003). All four parts can be proved using the same approach as in our proof of 
Theorem [TJ 
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The practical implications of Theorem [3] are similar to those of Theorem [TJ Roughly, when 
fitting a Poisson mixture model, one needs to choose a multi-modal (respectively, heavy-tailed) 
mixing distribution in order to capture multi-modality (respectively, heavy tails) in the data. 
Note that "heavy" means "heavier than the exponential" . 

Sichel (1982) studies a generalized inverse Gaussian Poisson (GIG-P) distribution in an 
economics context. This is a Poisson mixture as in Theorem [3] where the mixing density g{x) is 
a generalized inverse Gaussian ([3]). When x = the resulting mixture is a negative binomial. 
Corollary [3] states unimodality and log-concavity results analogous to those of Corollary [H We 
omit the parallel derivations. 

Corollary 3. In the setting of Theorem let g(x) be the GIG density given by 
i The GIG-P probability mass function, f{y), is unimodal. 
a The function f{y) is decreasing iff f{0) > /(I)- 
Hi The function f{y) is log-concave iff X> 1. 

The analogy is not complete, however. Characterizing log-convexity for the GlG-P family 
seems to be an open problem, for which Theorem [3] only yields a trivial sufficient condition. 

3 Derivation of Main Results 

For convenience, we say that a function h{x) has property S on an interval /, if h{x) has at most 
two sign changes on x £ I and, in the case of two changes, the sign pattern is —,+,—. (The 
number of sign changes is counted discarding zero terms.) Lemma [T] characterizes unimodal- 
ity (respectively, log-concavity) in terms of how h{x) crosses all horizontal lines (respectively, 
exponential curves). 

Lemma 1. A function h{x) is unimodal on an interval I iff h{x) — c has property S on I for 
all c G R; a nonnegative h{x) is log-concave on I iff h{x) — ae^^ has property S on I for all 
a > 0, 6 G R. 

A kernel k{x,y) on x € / and y G J {I, J are intervals) is totally positive (TP) if for each 
m > 1 and all xi < X2 < • • • < Xm, Vi < y2 < • • • < Vm-, Xi G /, yj G J, the determinant 
of the m X m matrix with k{xi,yj) as the {i,j) entry is nonnegative. The kernel k{x,y) is 
strictly totally positive (STP) if such determinants are strictly positive. We rely on a variation- 
diminishing property of TP kernels; see Karlin (1968) for the general theory. 

Lemma 2. Suppose k{x,y) is a TP kernel on x £ I, y £ J, and h(x) is a function that has n 
sign changes on I. Then, assuming the integral is absolutely convergent, q{y) = Jj k{x, y)h{x) 6.x 
has at most n sign changes on y £ J. Moreover, if q{y) has exactly n sign changes, then its 
sign pattern as y increases on J is the same as that of h{x) as x increases on I. In particular, 
if h{x) has property S on I then q{y) has property S on J. 

The idea of using Lemmas [U and [2] to establish unimodality, log-concavity or log-convexity 
comes from Propositions 3.1 and 3.2 in Chapter 1 of Karlin (1968). Karlin's results concern 
monotonicity and concavity, but the method works for unimodality, log-concavity and log- 
convexity after some modifications. 
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Hx,y) = i — exp 



Proof of Theorem [II If /3 = then y in ([T|) is unimodal with a unique mode at y = /i for 
arbitrary g. Hence parts (i)-(ii) hold. Parts (iii)-(iv) wih follow from the /3 7^ case by taking 
limits. Let us consider /3 7^ 0. After a linear transformation we may assume = 0, /3 > and 
(7 = 1. The density of Y becomes 

POO 

f{y)=i k{x,y)9{x)dx, -00 < y < 00, 
Jo 

2x 

Two properties of the kernel k{x, y) are easily established: 

(a) k{x,y) is STP in x > 0, y > 0; 

(b) we have the identity 

POO ^ 

/ Hx,y)dx = -, y>0. (5) 
Jo P 

Part (a) is obvious after writing k{x,y) = u{x)v{y) exp [— y^/(2x)] (see Karlin 1968, p. 18). Part 
(b) is a calculus exercise. Alternatively, (b) holds because the GIG density ([3|) with A = 1/2 
integrates to one. 

If g{x) is unimodal, then by Lemma[TJ g{x) — cf3 has property on x € (0, 00) for arbitrary 
c. By ([5]) we have 

/•oo 

f{y) - c = / k{x, y) [g{x) - c/3] dx, y > 0. (6) 



By Lemma [21 f{y) — c has property S on y £ [0,oo). We conclude that f{y) is unimodal on 
y £ [0, 00) since c is arbitrary. For y < we may differentiate inside the integral and directly 
obtain 

f'iy) = /" f /3 - -) H^, y)9{x) dx > 0. (7) 

Jo ^ ^' 

Thus /(y) is unimodal with a nonnegative mode overall. By invoking strict total positivity, we 
can show that the mode is unique (see Karlin 1968, Chapter 5, Theorem 3.1). This proves part 
(i)- 

Similarly, if g(x) is decreasing, then Lemma [2] and ([6]) show that /(y) — c changes signs at 
most once, from + to — , on y E [0, 00). As c G R is arbitrary, this means /(y) decreases on 
[0,00), which implies that the mode is at y = 0. This proves part (ii). 

For parts (iii)-(iv) let us define 

Ky) = log /(y) - log c - (/3 - 7)y 
for c > and 7 G R. As long as 7 7^ 0, we have the following generalization of ([6]) 

/(y) - cexp(/3y - |7y|) = / A:(x, y)/i(x) dx, y G R; (8) 

Jo 

'/32-72 



h{x) = g{x) — c|7| exp 

Suppose y(x) is log-concave. By Lemma [H h(x) has property 5 on x G (0,cxd). By Lemma [21 
the right hand side of ([8|) has property 5 on y G [0,cxd). 
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Suppose 7 > 0. Then l{y) has property S on [0,oo) because it has the same sign pattern as 
the left hand side of (j8]). For y < we apply ([7]) to obtain 

ny) = Q^-/3+7>^>o. 

fiy) 

If the sign pattern of l{y) on [0, oo) is — , +, — , then l{y) < for y < because of monotonicity. 
Hence the overall sign pattern of l{y) on (— oo,c«) is still —,+,—. Other cases are similar. It 
follows that l{y) has property S on y £ (—00,00). 

Suppose 7 < 0. We can similarly show that l{y) decreases on y G [0, 00) and has property S 
on y £ (—00, 0]. It follows that l{y) has property S for arbitrary c > and 7 G R. By LemmalU 
f(y) is log-concave, which proves part (iii). 

Finally, suppose g{x) is log-convex on (0, 00). Parallel to part (iii), we can show that if 7 > 0, 
then —l{y) has property S on y £ (0, 00). If 7 < then —l{y) increases on (0, 00). Hence —l{y) 
has property S on y £ (0,oo) for arbitrary 7 G R. It follows that f{y) is log-convex on (0,oo). 
The (—00,0) case is similar. Hence part (iv) holds. □ 

Remark. In Theorem [H under the stronger assumption that x~^^'^g{x) is log-concave, we 
have an alternative proof of the log-concavity of f{y)- The key is Prekopa's theorem, which 
states that if a joint Lebesgue density is log-concave then so are the marginals. The density of 
{X,Y) is 



u{x, y) (X X ^^'^g{x) exp 



{y - Px 
2x 



,2 



X > 0, 2/ G R. 



It is easy to verify that (y — /3x)^/(2x) is jointly convex in x > 0, y G R. Hence u{x,y) is 
jointly log-concave. By Prekopa's theorem, /(y) = u{x,y)dx is log-concave in y G R. This 
argument, however, does not work under the weaker assumption that g{x) is log-concave. See 
Yu (2009) for a similar situation. 

To prove Corollary [H the following facts concerning modified Bessel functions are used (see 
Abramowitz and Stegun 1972, Chapter 9). We have 



K,{z) = K^,{z); (9) 
lim z^K^iz) = l^-^riu), iy>0; (10) 

Kiz) = --K,{z) - K,^i{z). (11) 

z 

Proof of Corollaryl^ Let g{x) denote the GIG density given by da]) with X = ^ = - Z?^- 
Then g{x) is unimodal, and so is /(y) by Theorem [TJ Hence part (i) holds. For parts (ii)-(iv) 
we assume ^ = 0, /3 > 0. If 5 = 0, < A < 1, then the mixing density g{x) is decreasing, and 
by Theorem [H /(y) has a unique mode at 0. Conversely, if /3 > and S > or j3 > 0, 6 = 
and A > 1, then direct calculation using 1^ and (©-([U]) yields /'(0)//(0) = /3 > 0. Hence the 
mode is strictly positive. This proves part (ii). If A > 1, then the GIG density is log-concave, 
and so is /(y) by Theorem [H If A < 1 then /(y) cannot be log-concave because as y — )• ±00, it 
behaves like to a multiplicative constant (Barndorff-Nielsen and Blaesild 1981). 

This proves part (iii). Finally, if 5 = and < A < 1, then g{x) is log-convex. By Theorem [H 
/(y) is log-convex on each of (0,oo) and (— oo,0). Conversely, if /(y) is log-convex on each of 
(0,00) and (—00,0), then it increases on (— oo,0) and decreases on (0, 00), because /(y) — )• 
as |y| —7- 00. By part (ii), we must have either /3 = Oor5 = 0<A<l. If/3 = 0, 5 > or 
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p = S = 0, A > 1, then by P and ©-([n]), we have /'(O) = 0. By log-convexity f'{y) > 
for y > 0, which is again a contradiction. Therefore we must have 6 = and < A < 1. This 
proves part (iv). □ 

The rest of this section proves Proposition [H Theorem [5] and Corollary [21 Let us henceforth 
assume // is all zero and A is the identity matrix in the general results follow after a linear 
transformation. The density of Y is 



where g{x) again denotes the mixing density. 



2x 



dx, 



(12) 



Proof of Proposition [21 For y £ Tit we can write y = (3t + z such that = 0. Then (I12p leads 
to 

roo 

{2TTxyP/^g{x)exp 



f{y) 



2(t - x)2 + \\zf 



2x 



dx. 



(13) 



Hence /(y) depends only on ||z|| and in fact decreases in ||2;||, yielding spherical contours on each 
Tit (a linear transformation makes them ellipsoidal). □ 

Proof of Theorem\^ If /3 is all zero, then f{y) is symmetric with a single mode at zero for 
arbitrary g. Let us assume /3 / and consider f{y) on the hyperplane lit as in (fT3|) . Evidently, 
if G is a local mode, and y^, = I3t + z^,, then ||z*|| = by (fTS]) . Thus all local modes of /(y) 
lie on the line y = I3t, t E R. We have 



fm 



{2ttx) 



9{x) exp 



2x 



dx. 



If 5*(2;) = x~^P~^^/^g((x) is unimodal, then by Theorem [H f{/3t) is unimodal in t G R with a 
unique mode. (Although Theorem [His stated in terms of a proper mixing density, we note here 
that whether g*{x)dx < oo is not essential; we only need f{(3t) < oo for t / 0, which is 
easily verified.) That is, /(y) has a single local maximum on the line y = f3t, t G R. This proves 
part (i). Moreover, if g*{x) is decreasing, then by Theorem [H the mode of f{l3t) is at t = 0. 
Thus part (ii) holds. 

For part (iii), let us consider /(a + bt) as a function of t G R for a, 5 G R^ such that \\h\\ = 1. 
After some algebra, (fT2]) yields 



exp 



6^/3) t 



/(a + bt) = C 



-1/2 



where C is a constant not depending on t, and 

g{x) = y*(x)exp 



g{x) exp 



2x 



{t + a^h 



2x 



dx. 



(14) 



(15) 



li g*{x) is log-concave on (0,oo), then so is g{x) because \aJh\ < ||a||||6|| = ||a||. By Theorem[Tl 
the right hand side of (jl4p is log-concave in t G R, and so is f{a + bt). The claim holds since 
a, b are arbitrary. 

Finally, suppose g*{x) is log-convex. Let us consider a = in (fT^ and (fT5]l . Then g{x) is 
log-convex. By Theorem [T] the right hand side of (jl4p is log-convex in t G (0, oo), and so is 
f{bt). This proves part (iv). □ 
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Proof of Corollary\^ Let a,b £ RP such that b ^ 0. Then /(a + bt) is proportional to a 
univariate GH density due to closure properties of the MGH density over linear transformations 
and conditioning. By Corollary [H f{a + bt) is unimodal in i G R, which implies convex contours 
as a, b are arbitrary. This proves part (i). Proofs of the other parts parallel those of Corollary [T] 
Consider part (iii) for example. If A > (p + l)/2, then 

2.-{p-l)/2^(^) is log-concave, and by 
Theorem m so is f{y). Conversely, if f{y) is log-concave, then by considering /(a + bt) with 
fixed a, b, Corollary [1] yields A > (p + l)/2. Hence the log-concavity claim holds. □ 
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